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ABSTRACT 

Both common factor analysis and components analysis are useful techniques 
for reducing the number of variables in a data set or for identifying 
underlying covariance structures that exist among a set of variables « 
Although researchers have for some years debated the appropriateness of 
selecting one of these methods over the other, components analysis has 
traditionally been the dominant strategy among educational researchers « 

Following a brief , simplified overview of the logic of factor analysis, a 
review of a number of studies comparing the common factor and principal 
components methods is presented. Actual educational research data are used to 
demonstrate cases in which the two methods will produce different results. In 
addition, guidelines are offered to aid the researcher in determining which 
method to employ in specific research situations. 
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COMMON FACTOR ANALYSIS OR COMPONENTS ANALYSIS: 
AN UPDATE ON AN OLD DEBATE 

Factor analysis is an elegant statistical technique that may be used by 
the social scientist both in theory development and in the validation of 
measures of human behavior and abilities. Consequently, factor analysis has 
been described as "one of the most powerful tools yet devised for the study of 
complex areas of behavioral scientific concern" (Kerlinger, 1986, p. 689), and 
as "the furthest logical development and reigning queen of the correlational 
methods" (Cattell, 1978, p. 4). Factor analytic methods allow the researcher 
to reduce a set of observed variables to a smaller set of latent variables 
expressing common dimensions of the observed variable set. Nunnally (1967 , p. 
289) noted that this data reductive property of factor analysis "is useful in 
the explication of constructs." Likewise, Barman (1967, p. 4) noted, "The 
chief aim [of factor analytic methods] is to attain scientific parsimony or 
economy of description." 

Cattell (1978, p. 5) has proposed a two-pronged application of factor 
analysis in the behavioral sciences: 

One important use of factor analysis is in finding the 
"significant dimensions". . .in a jungle of variables. . . .A 
second major service of factor analysis is in exposing and 
counteracting that vice of the human mind which. . .assumes that 
when there is one word [for a given construct] there must be one 
thing corresponding to it in the real world. 
Despite the general agreement among social science researchers (e.g., 
BooKsteir, 1990; Cattell, 1952, 1978; Gorsuch, 1983, 1990; Barman, 1967; 
McArdle, 1990; Mulaik, 1972, 1990; Nunnally, 1967; Steiger, 1990; Velicer & 
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Jackson r 1990) that factor analysis is an important research tool, there is 
considerable controversy regarding the appropriateness of various techniques 
for determining factor solutions. More specifically, the controversy centers 
around whether principal components analysis (PCA) or common factor analysis 
(CFA) is the most appropriate technique to use in research situations 
employing structural factor analysis. 

The purpose of the present study is to review some of the arguments of 
the proponents of PCA and CFA, and to offer some guidelines for the 
appropriate choice of method in various research situations* A brief overview 
of the logic underlying factor analysis along with a simple heuristic example 
is provided for the novice reader. A review of various empirical studies 
which have compared PCA and CFA is presented. Finally, several factor 
analyses utilizing actual educational research data are presented to 
demonstrate cases in which the two methods will produce similar and different 
results. 

The Logic of Factor Analysis — A Brief Overview 

Social scientists (e.g., Cattell, 1952, 1988a; Rummel, 1970) have 
conceived of a three-dimensional model (often referred to as a "data box" or 
"data cube") for measuring and describing any given psychological or 
ideological phenomenon. The three dimensions (called modes) which constitute 
this model are generally considered to be persons, variables, and occasions of 
measurement (Cattell, 1352). Factor analytic techniques usually involve two 
of these three modes, one of which is factored across the other. 

In conducting an "R-technique" factor analysis, the most commonly-used 
factor analytic technique, the researcher first selects a finite set of 2 
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variables from a universe of possible variables designed to measure a specific 
construct. The choice of variables may be based on theoretical considerations 
or on the researcher's own hypothetical notions regarding the nature of the 
construct. Data are collected on these variables from a sample of persons 
thought to * representative of the researcher's population of interest. A 
square (2 x 2) matrix of association (correlation matrix) is constructed to 
determine the intercorrelations among the £ variables. On the basis of these 
correlations, a new rectangular correlation matrix is constructed with the £ 
variables serving as the rows and m common factors serving as the columns. 

The researcher's goal in R-technique factor analysis is to reproduce as 
much of the variance in the original set of p variables as possible in m 
interpretable factors (m < p) . By examining the content of the variables 
which correlate most highly with each factor (determined by consulting the 
factor structure coefficients or "loadings" of each variable with each of the 
factors) , the researcher then attempts to give a name to each of the m 
interpretable factors underlying the original variable set. If the 
researcher's goal is theory development, the resultant factors serve as 
indicators of the various dimensions underlying the construct the researcher 
is endeavoring to measure. If the goal is construct validation of a 
psychometric instrument, the factors indicate the various constructs the 
instrument is purported to measure. 

In "Q-technique" factor analysis, the most commonly-used alternative to 
R, the same two dimensions (variables and people) are used as in R-technique 
although they are reversed (Comrey, 1973), i.e., the people are factored 
across the variables, yielding "person factors," or clusters of persons who 
behave or think differently than persons in other clusters in terms of the 
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constructs being measured. Although the present study is limited to 
discussion of the R^technique factor analysis, detailed discussions of the 
logic underlying Q-technique are provided elsewhere (Kerlinger, 1986; 
Stephenson, 1953). A number of research studies in which Q^technique factor 
analytic methods have been employed are also available for further study by 
the interested reader (e.g., Carr, 1989; Daniel, 1989b; Thompson, 1980; 
Townsend, 1987). 

As a heuristic example of the logic underlying R-technique factor 
analysis, consider a hypothetical research situation involving 10 variables 
collected on each of 15 subjects, as presented in Table 1. The square inter* 
item correlation matrix of association for these 10 variables is presented in 
Table 2, and the rectangular "factor solution" matrix is presented in Table 3. 



INSERT TABLES 1. 2, AND 3 ABOUT HERE 
An analysis of the factor structure coefficients of the variables with the 
identified factors as shown in Table 3 indicates that two distinct factors are 
being measured, with Items 1, 5, 6, 7, and 9 correlating highly with Factor I, 
and the remaining items correlating highly with Factor II. In an actual 
research situation, the next two logical steps would be to examine the items 
identified with each factor, and then to name the factors. 

Methods for Determining Fad. or Solutions 

The most difficult (and perhaps the most subjective) part of conducting a 
factor analysis is determining the factor solution. Indeed, some have argued 
that factor analysis is so subjective that it should be used with extreme 
caution. For instance, Armstrong (1967) in an amusing article subtitled "Tom 
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Svift and His Electric Factor Analysis Machine" named of the danger of 
routinely using exploratory factor analysis to drive theory development in 
lieu of forming a priori theoretical assumptions about the relationships 
among variables. Hovever, as explained by Daniel (1989a), these exploratory 
factor analytic methods can be followed up with confirmatory methods which can 
directly take into account the a priori relationships among the variables. 

At least three processes are involved in determining a factor solution: 
(a) selecting from among several mataematical models for determining the 
values of factor structure coefficients, (b) determining if and how the factor 
results should be rotated, and (c) deciding on the number of factors to 
extract and interpret. In the present study, only the first of these three 
processes is considered. However, a considerable amount of writing has also 
been devoted to the issues of factor rotation (e.g., Cattell, 1978; Gorsuch, 
1988; McDonald, 1985; Mulaik, 1972; Nunnally, 1978) and number of factors 
(e.g., Cattell, 1966; Kaiser, 1960; Linn, 1968; Zwick & Velicer, 1982, 1986). 

Historically, there has been considerable controversy regarding the 
appropriateness of various techniques for determining factor solutions. 
Spearman's early ideas about mental abilities led to his development of the "G 
factor" theory (Spearman, 1904), i.e., the notion that all mental abilities 
are ultimately explained by a single general or G factor. Nunnally (1967, p. 
334) has explained the G factor theory as follows: 

The general factor was thought of as a type of mental yardstick 
of intelligence, and only one yardstick was thought necessary to 
explain the common ground among all forms of individual 
differences in abilities. Thus tests as diverse as tests of 
arithmetic, spelling, and the judgment of illusions were thought 
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to share in G. In addition r it was theorized that each source of 
individual differences (test) possessed a unique factor. . • • 
The theory is sometimes called Spearman's two-factor theory > 
because it hypothesized that each test could be explained by a 
general factor and a unique factor, (emphasis in original) 
Holzinger extended the G factor theory in his development of the 
••bif actor" solution (Holzinger, 1941; Holzinger t Swineford, 1937) . This 
theory hypothesizes that the common variance among a series of tests (or 
variables) can be accounted for by both a general factor and by two or more 
"group" factors. Hence, all of the group factors correlate with the G factor, 
but not with one another. Munnally (1967, p. 339) provides a simplified 
example of this factor solution. 

By the 1930* s, statisticians were beginning to develop the more 
sophisticated factor analytic models used most frequently today. Hotelling 
(1936) proposed methods for calculating the principal components solution. 
Principal components analysis (PCA) yields linear combinations of observed 
variables based upon a referent square matrix of correlation among the 
variables with unity values on the matrix diagonal. PCA assumes that the 
inter-variable correlation matrix produced by the data from a given sample 
perfectly reflects the population correlation matrix. In other words, PCA 
assumes that the variables have been collected without error (Gorsuch, 1988). 
The analysis yields n principal components from a set of n variables (i.e., 
the number of components equals the number of variables). Generally, the 
researcher selects from among these n factors the first several which account 
for a major portion of the variance. The selected factors are interpreted and 
named, whereas factors with negligible contributions to the variance are 
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ignored. The result is a "truncated** component solution (Gorsuch, 1988). 

In common factor analysis (CPA) , the unity diagonal values on the inter- 
variable correlation matrix are replaced with communality (h^) estimates of 
each variable. As noted by Gorsuch (1978, p. 24), variable communalities are 
the correlations of each variable with the "common elements" underlying the 
variable set. One frequently-used mathematical analog for determining lower- 
bound estimates of the communalities is to compute the squared multiple R of 
each variable with all the otiier variables. Since error variance is included 
in any data set, the communality estimates are less than unity in value. 

CPA involves the computation of both common and unique factors. Por n 
variables, there will be n unique factors that explain variance unique to each 
variable. CPA generally allows the researcher to extract k common factors for 
the n variables such that 1 1 k 1 n. As noted by Cure ton and D'Agostino 
(1983), the number of common factors is generally less than half the number of 
variables (k i n/2). As each factor is extracted, a resultant "residual" 
matrix is calculated. This matrix expresses the relationships among the 
variables with the effect of the first factor removed from the analysis. In 
the "pure" mathematical model of CPA, factors are extracted until "some 
statistical >st says the last residual matrix is essentially full or zeros" 
(Cattell, 1978, p. 30). 

As previously noted, the controversy surrounding various factor analytic 
methods has yielded many diverse opinions of the viability of one method over 
another. Pairly early in the development of CPA and PCA, Cureton (1939) 
responded to the ongoing debates among the "two-f actorists, " "bi-factorists, " 
"multiple-factorists," and "component analysts": 

Pactor theory may be defined as mathematical rationalization. A 
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factor-analyst is an individual with a peculiar obsession 
regarding the nature of mental ability or personality. By the 
application of higher mathematics to vishful thinking, he always 
proves that his original fixed idea or compulsion was right or 
necessary. In the process he usually proves that all other 
factor-analysts are dangerously insane, and that the only 
salvation for them is to undergo his own brand of analysis in 
order that the true essence of their maladies may be discovered, 
(p. 287) 

Interestingly, by 1967, one researcher stated, "The heated and inspired 
controversies about the 'best' method of factor analysis are over" (Barman, 
1967, p. 9). However, even though the old G factor and bi-factor theories are 
now outdated, the controversy over common factor analysis and principal 
components analysis is very much alive. In fact, Bentler and Kano (1990) 
recently noted that the controversy is approximately 50 years old. 

One reason for the controversy is the affinitive relationship between the 
two methods. Gorsuch (1990, p. 33) notes the similarity and distinction 
between the two: 

Common factor analysis is the general case of which component 
analysis is a special case. Common factor analysis includes 
viiriables with error and variables without error because, in the 
latter case, certain elements become zeros. Component analysis 
limits this broader model by an additional assumption: the 
variables are reproduced without error (i.e., without 
uniquenesses or residuals). 
According to Acito and Anderson (1980) and Gorsuch (1983), principal 
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components analysis is lost useful when the nuaber of variables is large and 
when variable comnunalities are relatively high (above 0.40) and homogeneous 
in value. As demonstrated by Gorsuch (1983), Harris (1970), and Velicer 
(1974, 1976, 1977), when these two assumptions are met, PCA and CFA yield 
nearly identical results. "However, with a smaller number of variables, the 
values in the diagonal assume greater importance and the loadings in the 
resulting factor pattern [using PCA] may tend to be inflated" (Acito & 
Anderson, 1980, p. 230). 

Considering the problems associated with PCA, many have suggested that 
PCA should not be routinely used as a factor analytic procedure (e.g., 
Cureton, 1939; Cattell, 1988b; Gorsuch, 1990; Harman, 1967; McDonald, 1985; 
Mulaik, 1990). Cattell (1988b), for instance, notes that for PCA to be 
viable, the set of variables under consideration "would have to lie in a 
completely self-explanatory subuniverse, self-sufficient as a system entirely 
isolated from the rest of the universe. The components model must be 
considered a mere mathematical figment" (p. 134). Similarly, McArdle (1990, 
p. 81) notes that "PCA is a poor substitute for a more complete and reliable 
CFA." Despite these sentiments, PCA remains the preferred method in 
behavioral research (Pruzek & Rabinowitz, 1981). This preference may be due 
in part to the setting of PCA as the default factor extraction method in many 
popular statistical software packages such as SPSSX (Hubbard & Allen, 1987). 

In opposition to the many writers who prefer CFA, there are a number of 
researchers who adamantly defend the viability of PCA. Chief among the 
defenders of PCA are Velicer and his associates. Velicer has sought through 
results of a host of empirical studies to show that PCA and CFA rarely produce 
notably different results (e.g., Velicer, 1974, 1976, 1977; Velicer & 
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Jackson, 1990). Wilkinson (1989) concurs with the findings of Velicer, and 
further notes that most attempts to discredit the merits of PCA focus upon 
mathematical artifacts associated with contrived example data sets. Wilkinson 
affirms that when analyzing real data, PCA and CFA generally do not differ 
enough to matter. 

Others argue for the preference of PCA over CFA on the basis of 
parsimony. PCA is more mathematically direct and simple, thus it requires 
less computational energy (and less consumption of computer processing units). 
Gorsuch (1990) r however, points out that elegance in scientific models does 
not necessarily always mean mathematical simplicity. 

Gorsuch (1988) and Snook and Gorsuch (1989) note that preference of PCA 
over CFAr or vice-versa, is often n matter of paradigmatic biases of the 
researcher. PCA, notes Gorsuch (1988), follows a mathematical paradigm, i.e., 
it stresses derivation and exact computation of all procedures. CFA, on the 
other hand, follows a scientific, or statistical, paradigm. This latter 
paradigm stresses the value of estimation and replication of results across 
various samples. Preference for the CFA model is noted by McArdle (1990) , who 
states, "The PCA always overestimates the true loading, and the accuracy of 
PCA becomes very poor as the true loadings get smaller. In these math models 
the CFA algorithms are always 100% accurate." 

Empirical Comparisons of the Two llethods 

A number of researchers have sought to empirically test the convergence 
of factor solutions offered by PCA and CFA. A number of these studies (e.g., 
Acito & Anderson, 1980; Borgatta, Kercher, i Stull, 1986; Linn, 1968; Snook & 
Gorsuch, 1989: Velicer, Peacock, & Jackson, 1982) have involved the factor 
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analyzing of Honte Carlo generated of data. Acito and Anderson (1980) 
compared principal components analysis with two common factoring techniques 
(image and alpha factoring) using generated data sets of 50, 100 r and 300 
cases. The results suggested that image and alpha analyses were superior to 
principal components analysis in reproducing known factor patterns among 
variables. In fact, the Acito and Anderson analyses suggest that image 
analysis with 50 cases is almost as accurate as principal components analysis 
with 300 cases. In a similar study. Snook and Gorsuch (1989) found that 
common factor analysis reproduced known factor structures more accurately than 
principal components analysis when factor patterns identified ^rith generated 
data were compared with population factor patterns. 

Other comparative studies have involved actual data sets. For instance, 
Velicer (1977) compared two CFA techniques with PCA by reanalyzing data from 
nine studies. All the studies employed rather large sample sizes and large 
numbers of variables. Velicer 's comparisons were based on observing 
similarities and differences in the factor patterns across the methods and on 
a summary statistic g based on summation of the squared differences in 
comparable factor structure coefficients across the methods. Velicer 
concluded that the patterns produced across the methods were remarkably 
similar, with the greastest differences noted in the last (weakest) factor 
extracted. A similar comparison study by Wilkinson (1989) confirms the 
Velicer findings. Wilkinson concludes that most differences between CFA and 
PCA are the result of "overf actoring" of data, i.e., extracting of too many 
meaningless factors (p. 456). 

Hubbard and Allen (1987) also compared PCA with CFA using actual data, 
but their findings were in sharp conf-^st to those of Velicer and Wilkinson. 
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Hubbard and Allen provide empirical evidence across several studies that 
illustrates differences in the magnitudes of the factor structure coefficients 
using PCA and CFA. In several of the cases, these shifts in coefficients lead 
to differences in the interpretation of the factors. Similarlyr Gorsuch 
(198*' illustrated several factor analytic cases using a small number of 
variables in which the PCA and CFA methods yielded notably different results. 

Without a doubt, the issue of which of these methods ^ if either r is 
superior to the other is far from being resolved. Opponents of PCA have often 
used either hypothetical data or else have used a rather small number of 
variables as weapons in their factor analytic arsenal. It could be argued 
that either of these types of data may lead to distortion of factor analytic 
results. On the other hand, PCA advocates often try to explain away the CFA 
advocates' arguments by suggesting that factor indeterminacy is responsible 
for most of the differences in the two methods. 

Consequently r an empirical example is offered as a part of the present 
study in order to review and further explore these differences. An actual 
data set (Daniel & Okeafor, 1987) involving teachers' perceptions about their 
own professionalism on a "logic of confidence" scale was factor analyzed using 
both PCA and CFA. This data set serves as an interesting factor analytic 
example as attitudinal measures are frequently used in educational research. 

Method 

The data utilized in the following analyses were collected from 70 
preservice and inservice teachers using a three-subscale "logic of confidence" 
measure authored by Okeafor, Licata, and Ecker (1987). This self-evaluation 
measure included 29 Likert-type items. The illustrated data set borders on 
being a "problem" data set for factor analytic purposes as the number of 
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rariables and subjects are below ninimal criteria as established by Gorsuch 
(1983) and Comrey (1973). 

Responses of the 70 subjects on the logic of confidence measure were 
factor analyzed using principal components analysis and principal axis factor 
analysis. Analyses were run using the SPSSx FACTOR procedure. Two alternate 
rules were used for determining the number of factors to extract in comparing 
the results of the analyses. The "eigenvalue greater than unity'* rule 
(Guttmanr 1954) r was selected as it is a commonly used rule and as it is the 
default factor extraction method for many statistical software programs such 
as SPSSx. In a second round of analyses, Cattell*s (1966) "scree** test was 
utilized for determining the number of factors. The scree test relies on a 
visual plot of the eigenvalues, with the number of factors to extract 
determined by a break in the curve of the line formed by the plotted values. 
Generally, scree is a more conservative test, suggesting that response 
variance can be adequately accounted for in relatively few factors. All 
results were rotated to simple structure using the varimax criterion. 

Findings 

Principal components and principal axis factor analyses using the default 
"eigenvalue greater than unity" extraction rule were run for the data at hand. 
Ten factors were extracted for each solution. Initial variable communality 
estimates for the principal factor solution were generally high, with only one 
of the 29 values falling into the "low" (< .40) range as defined by Gorsuch 
(1983), and with nearly all of the communalities (23 of 29) above .50. 

Velicer (1977), Snook and Gorsuch (1989), and Hubbard and Allen (1987) 
propose several statistics by which various factor patterns may be compared. 
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However, the ability of a model to identify a viable factor structure is the 
most stringent test of a model, and, indeed, the aspect of the model of most 
interest to researchers employing factor analysis in empirical research. 
Hence, the more highly statistical means of comparing the two models were 
rejected for use in the present study in lieu of direct comparison of 
identified factor structures using the "eyeball" strategy* More specifically, 
the two models were evaluated and compared on the degree to which they 
produced factors which were uniquely saturated with given items. For the 
purposes of these comparisons, an item was considered uniquely associated 
with a factor if (a) it had a higher factor structure coefficient on that 
factor than on any other factor, and (b) its coefficient on that factor 
exceeded i.40i. 

The "unique" items associated with these factors, along with the items* 
factor structure coefficients, are presented in Table 4- The two analyses 
yielded nearly identical results up through the fourth factor. Factors I and 
III, IV, and VI for both analyses included the exact same sets of items, and 
Factors II, and V varied by only one item each across the two solu*-' ^ns. 
Factors VIII and IX for the PCA analysis matched Factors VII and V^II, 
respectively, for the PAF analysis, and PCA Factor X matched PAF Factor IX 
with the exception of the former including one additional item. What is most 
notably different in the two analyses is the magnitude of the factor structure 
coefficients. 

Also, each analysis yielded one rather unique factor. The roost unique 
PCA factor was Factor VII, while Factor X was the most unique PAF factor. It 
could be effectively argued, however, that the last five factors on both 
analyses are hardly interpretable as roost of them correlated adequately with 



17 



Factor Analysis—p* 15 

only one or two items. These results suggest a problem with overextraction. 

The **scree** test indicated a clear break between Factors III and IV; 
hence, a three*f actor solution was run using each of the extraction methods. 
The resultant PCA and PAF factor matrices are presented in Tables 5 and 
respectively. The factor structures are very similar. With the exception of 
Item 20 r each of the items correlated most highly with the same factor across 
the two analyses. Again, the most notable difference between the two matrices 
was the magnitude of the structure coefficients. Since researchers rarely 
interpret items as defining « construct when structure coefficients are less 
than 1.301, a number of items might be omitted from interpretation if 
employing the PAF extraction method in the present research situation. 

Discussion 

The results of the foregoing analyses tend to confirm the assumptions (a) 
that the choice between PCA and CPA generally does not make a lot of 
difference, and (b) that differences shown by the two methods tend to be based 
on overextraction. The comparative ten-factor aiolutions using the two methods 
produced relatively similar results up until the seventh factor. By that 
point in the analyses, none of the remaining factors in either analysis was 
uniquely saturated with more than two items, making interpretation of these 
factlj^s almost meaningless. Hence, the results would indicate that the 
"eigenvalue greater than unity" rule led to overextraction in both models. 

The more viable three-factor solutions also yielded very similar results, 
suggesting that either method might yield an acceptable factor solution for 
the data at hand. However, these analyses illustrate that the magnitudes of 
the structure coefficients may be impacted upon by the choice of extraction 
method. In the present example, using an item saliency criterion of 1.30!, 
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the researcher would interpret seven more items with PCA than with PAF. 

Thus, the choice of CPA or PCA is largely unimportant in terms of the 
models' abilities to produce factor structures, but may lead to differences in 
interpretation of results. However , the data employed in the present analysis 
may not be typical of data encountered in educational research. Despite the 
problems associated with the size of the sample and to some extent the number 
of variables in the set, the variables' communalities were relatively high, 
which may have accounted for the similarity of results across the two methods. 

When employing a data set with low variable communalities, the researcher 
is generally advised to use CPA over PCA. However, it might be questionable 
whether the researcher should use any type of factor analytic technique if a 
substantial number of the variables have low communalities. Communalities 
provided lower-bound estimates of the reliability of the variables (Cattell, 
1978). Thus, if many of these variables have low communalities, the integrity 
of the instrument (s) being used is already at stake, and rather than 
representing viable constructs, the resulting factors are likely to be no more 
than mathematical artifacts. 

Generally, CFA is the safer of the two techniques. This is especially 
true in cases in which there are only a few variables or m which variable 
communalities are poor. Nevertheless, the researcher would be amiss to 
blindly employ CFA without examining the variable communalities. Despite the 
fact that the two methods generally yield very similar results, "common 
factor analysis provides a fail-safe procedure in terms of more accurately 
reflecting the population factor pattern because it gives accurate results 
when component analysis produces biased results as well as when both methods 
produce reasonably accurate results" (Snook & Gorsuch, 1989, p. 153). 
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Table 1 

Hypothetical Data Set for Factor Analysis Example 



CASE ITEMl ITEM2 ITEM3 ITEM4 ITEM5 ITEM6 ITEM7 ITEMS ITEM9 ITEMIO 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 



2 
2 
2 
1 
2 
1 
2 
2 
2 
2 
2 
1 
2 
2 
2 
2 
2 
1 
2 
1 



1 
2 



2 
1 



1 
2 
2 
2 
2 
2 
2 
1 
2 
2 
2 
1 
2 
2 
2 
1 
2 
1 
2 
2 



1 
2 
1 
1 
1 
2 
1 
1 
2 
1 
1 
1 
1 
1 
1 
2 
2 
2 
2 
1 



2 
1 
2 
1 
2 
2 
2 
1 
2 
2 
2 
1 
2 
1 
2 
2 
1 
1 
2 
1 



1 
2 
1 
2 
1 
1 
1 
2 
2 
2 
2 
2 
1 
2 
1 
1 
2 
2 
2 
2 



1 
2 
2 
1 
2 
2 
2 
1 
1 
1 
1 
1 
2 
1 
2 
2 
1 
1 
1 
1 



1 
1 
1 
1 
1 
2 
2 
1 
1 
2 
2 
1 
1 
1 
1 
1 
2 
1 
2 
2 



2 
2 
2 
1 
2 
2 
2 
2 
1 
2 
2 
1 
2 
1 
2 
2 
2 
1 
2 
1 



1 
2 
1 
2 
2 
2 
2 
1 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 
2 



ITEM2 
ITEM3 
ITEM4 
ITEM5 
ITEM6 
ITEM7 
ITEM8 
ITEM9 
ITEMIO 



Table 2 

Inter-Item Correlation for Table 1 Data 



ITEMl 


ITEM2 


ITEM3 


ITEM4 


ITEM 5 


ITEM6 


-.08085 












.20000 


.24254 










-.06052 


.57248 


-.06052 








.47140 


.05717 


.23570 


-.04280 






-.23570 


.05717 


.00000 


.17118 


-.66667 


-.79167 


.23570 


.22866 


.23570 


.04280 


.45833 


-.06052 


.27890 


.42366 


.12088 


.17118 


.17118 


.62994 


.27501 


.12599 


.02288 


.57907 


-.53452 


-.24254 


.17647 


.40423 


.30826 


-.05717 


.22866 



ITEM6 ITEM7 ITEMS ITEM9 



-.17118 
.53452 .25163 
.05717 .30826 -.27501 
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Table 3 

Factor Solution Matrix for Factor Analysis Example* 





FACTORl 


FACT0R2 


ITEHl 


.63649 


-.09264 


ITEH2 


.12548 


.71718 


ITE1I3 


.23546 


.59894 


ITEH4 


-.09758 


.59066 


ITEMS 


.80878 


.09956 


ITEM6 


-.83154 


.21605 


ITEM7 


.76894 


.10756 


ITEMS 


.02084 


.66631 


ITEM9 


.84367 


.13154 


ITEMIO 


-.24024 


.66871 



& These factors were extracted using principal components method, and 
rotated to the variaax criterion. 
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Table 4 

Items Associated with Factors Across Two Ten-Factor Solutions' 



Factor I 

PCA extraction 
Itea (Struc. Coeff.) 
9 (.86930) 
18 (.83911) 
14 (.82799) 
16 (.77404) 
26 (.53371) 



PAF extra' tion 

Itea (Struc. Coeff.) 

9 (.84517) 

18 (.82333) 

14 (.80383) 

16 (.76097) 

26 ( 47375) 



Factor II 



PCA extraction 

Itea (Struc. Coeff.) 

6 ( .82072) 

22 ( .75780) 

13 (-.57330) 

24 ( .55490) 

28 ( .51539) 



PAF extraction 

Itea (Struc. Coeff.) 

22 ( .77825) 

6 ( .69581) 

28 ( .53028) 

24 ( .48000) 

25 (-.42278) 



Factor III 



PCA extraction 
Item (Struc. Coeff.) 
7 (.77423) 
23 (.77226) 
27 (.62846) 
10 (.62814) 



PAF extraction 
Itea (Struc. Coeff.) 
7 (.74430) 
23 (.73915) 
10 (.61223) 
27 (.42985) 



Factor IV 



PCA extraction 
Item (Struc. Coeff.) 
8 (.70444) 
5 (.66238) 
20 (.54554) 



PAF extraction 
Item (Struc. Coeff.) 

5 (.78667) 

8 (.48972) 
20 (.48163) 



Factor V 

PCA extraction 
Itea (Struc. Coeff.) 
25 (.70828) 
19 (.63434) 
17 (.60123) 



PAF extraction 
Item (Struc. Coeff.) 
11 (.59110) 
17 (.561?.'^) 
19 (.43543) 



(cont.) 
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Factor 

PCA extraction 
Iten (Struc- CoeffJ 
3 (.80552) 
12 (.41247) 



PAF extraction 
Item (Struc. Coeff J 
3 (.58259) 
12 (.47724) 



Factor 

PCA extraction 
I tea (Struc. Coeff.) 
11 (.78386) 
1 (.54778) 



PAF extraction 
Item (Struc. Coeff.) 
21 (.71080) 



Factor 

PCA extraction 
Item (Struc. Coeff.) 
21 (.83544) 



VIII 

PAF extraction 
Item (Struc. Coeff.) 
15 (.83399) 



Factor IX 

PCA extraction PAF extraction 

Item (Struc. Coeff.) Item (Struc. Coeff.) 

15 (.89427) 2 (.59057) 

4 (.46328) 



Factor X 

PCA extraction 
Item (Struc. Coeff.) 
4 (.74118) 
2 (.54477) 
29 (.48707) 



PAF extraction 
Item (Struc. Coeff.) 
13 (.73277) 



Utems are listed under each factor in descending 
order of the absolute values of their factor structure 
coefficients. 
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Table 5 

Principal Conponents Three-Factor Rotated Solution Matrix 





FACTORl 


FACT0R2 


FACT0R3 


HYSELFl 


.14288 


-.52065* 


.17304 


HYSELF2 


.41299* 


.17433 


.01133 


HYSEIiF3 


.31451* 


.14906 


-.00643 


HYSELF4 


.30699* 


.08766 


-.03429 


HYSELTS 


.02536 


.66790* 


.13844 


HYSEI1F6 


.10650 


.09884 


.69682* 


MYSEU? 


.12625 


.61145* 


-.09756 


HYSELF8 


-.12928 


-.30911* 


.02487 


HYSELF9 


.83499* 


-.03698 


-.10900 


HYSELFIO 


.18936 


-.75860* 


-.09147 


HYSELFll 


-.16907 


.33500* 


-.11529 


lfYSELF12 


.16244 


-.02702 


-.32851* 


HYSEI1FI3 


-.07218 


.22922 


-.44816* 


HYSEI1FI4 


.83174* 


-.13026 


-.00943 


HYSELF15 


.24205* 


-.10032 


-.00679 


HYSEI1FI6 


.64798* 


-.04798 


-.09800 


HYSELF17 


-.24436 


-.50795* 


.26963 


HYSELFl 8 


.79585* 


-.02196 


-.31655 


HYSELFl 9 


-.09399 


-.25246 


.32288* 


HYSELF20 


-.22954 


.43853 


.43989* 


HYSELF21 


-.08753 


.18514 


.36532* 


HYSELF22 


-.03275 


-.01547 


.80000* 


HYSELF23 


.19866 


.65412* 


.22517 


HYSELF24 


-.28917 


-. 12307 


.59293* 


HYSELF25 


.07943 


-.43193 


.47359* 


HYSELF26 


.59723* 


-.22852 


-.33556 


HYSELF?: 


-.03055 


.51746* 


-.03223 


HYSELF28 


.28482 


.17814 


-.58234* 


MYSELF29 


-.02016 


.37165* 


6860 



Note: Starred values indicate the factor with which the item is most highly 
correlated. 
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Table 6 

Principal Axis Three-Factor Solution Matrix 



MYSELFl 

IfYSELF2 

MYSELFa 

IfYSELF4 

MYSELFS 

MYSELFe 

MYSELF7 

HYSELF8 

MYSELF9 

MYSELFIO 

MYSELFll 

MYSELFl 2 

MYSELFl 3 

MYSELFl 4 

MYSELFl 5 

HYSELF16 

MYSELF17 

MYSELF18 

MYSELFl 9 

HYSELF20 

MYSELF21 

HYSELF22 

MYSELF23 

MYSELF24 

MYSELF25 

HYSELF26 

MYSELF27 

HYSELF28 

MYSELF29 



FACTORl 

.09660 

.31552* 

.24243* 

.23112* 

.00871 

.05638 

.10974 

-.11300 
.83380* 
.17546 

-.12437 
.14578 

-.02534 
.81483* 
.18976* 
.57658* 

-.22177 
.80470* 

-.09070 

-.21556 

-.10509 

-.04702 
.17001 

-.28994 
.04221 
.54865* 

-.02054 
.27635 

-.04701 



FACT0R2 

-.46131* 
.13943 
.11808 
.06110 
.60715* 
.08050 
.55560* 
-.26093* 
-.03624 
-.74746* 
.27625* 
-.02774 
.19659 
-.13213 
-.08857 
-.04487 
-.45467* 
-.01779 
-.21178 
.39585 
.15187 
-.02125 
.61020* 
-.11789 
-.39546 
-.21093 
.45023* 
.16718 
.31418* 



FACT0R3 

.13520 
-.01636 
-.02390 
-.04882 
.11227 
.62719* 
-.09600 
.02463 
-.08566 
-.08782 
-.07385 
-.26842* 
-.35905* 

.00375 
-.01629 
-.11001 
.24037 
-.30109 
.25972* 
.40198* 
.28572* 
.78626* 
.20555 
.53214* 
.41046* 
-.31670 
-.02123 
-.52987* 
.29613 



Note: Starred 
correlated. 



values indicate the factor 



with which the item is most highly 
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